CDS
Accession Number | TCMCG074C30257 |
gbkey | CDS |
Protein Id | KAF8411864.1 |
Location | complement(join(47602297..47602480,47602719..47602796,47608849..47608940,47615032..47615185,47615710..47616032,47619237..47619657,47620741..47621294)) |
Organism | Tetracentron sinense |
locus_tag | HHK36_004423 |
Protein
Length | 601aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA625382, BioSample:SAMN14615867 |
db_source | JABCRI010000002.1 |
Definition | hypothetical protein HHK36_004423 [Tetracentron sinense] |
Locus_tag | HHK36_004423 |
EGGNOG-MAPPER Annotation
COG_category | O |
Description | Protein of unknown function (DUF3752) |
KEGG_TC | - |
KEGG_Module | - |
KEGG_Reaction | - |
KEGG_rclass | - |
BRITE | - |
KEGG_ko | - |
EC | - |
KEGG_Pathway | - |
GOs |
GO:0001664
[VIEW IN EMBL-EBI] GO:0003674 [VIEW IN EMBL-EBI] GO:0005102 [VIEW IN EMBL-EBI] GO:0005488 [VIEW IN EMBL-EBI] GO:0005515 [VIEW IN EMBL-EBI] GO:0050780 [VIEW IN EMBL-EBI] |
Sequence
CDS: ATGAATTTAGTTGGAATCTCGCCCGATATCGCCACACACTCCATCTTGATCAATTGCTTCTGCGGCTTGCGTCGGGTGGATTTCGGTTTCTCCGTATTAGGTAGCATCTTGAAACGTGGTTATGCACCAAATGTAGTAACCTTCACCACTCTAATTAAGGGGCTCTGTGCCGAGGATAGAATCATTCAAGGTGTGGAATTGTTGAACAAAATGGCAGAGACTGAATATCAACCTAATATAATTACATATGGAACTATAATTAACTGGCTTTGCAAAACGGGGAACGCTGGCACGGCTATGAGGGTGCTTAGGAAGATGGAGAAAGGAAGTTGTAAACCTAGCTTAGTAGTTTATAGCATGATCATCGACAGTCTGTGCAAGGATAGATTGGTAACTAAGGCTTTGGAGCTACTCTCGGAAATGACCGGTAAAGGCATTCAACCAGATGTTGTGACCTACAGCTGTTTGATAGATGGCCTATACAGTTCAGGTCGGTGGAAAGAAGCTACAAGATTGTTTAATGAAATGAGGAAGATTAAGGAAGTCCCAAATTGTCGTACCAGTAAAAGCATTAGAAGTAAAAGGAAGCATTTGGAAGAGACAAGTAGTTCATCTTCATCACAATCATCATCATCAGACCCTGACAGTGAAAGAAGCCCTGAGAAGAGATATAGTTCCCATCGTCATCGAGAAGATAGGCACAAAAGCATTAATGGTTCTAAAAAGGAGAAGGAGAAGAATGAGAGAACAGGAAAAAGCAGAAATAAGAAAGAAAGGCAGGATAGGAAGGGTAAACAAAGAAGGGACCCAAAACGAAGATCGCGGCGAGAGGATAGAAGGCGGTTGGCTAAGACCAATGATACCGAATCTTCGGACGATGATCACTTGGAGCCGTCGAAATCTCGAAACAAACCGGAGACTATCCTTCGATACATCTTGAAAACATTTCTCAATGTTGGTGATGATTTGAAACAGCTTCTAAAAATGATCGATGATGGGCAAGCTGTTGACACAAGGGGAATATCTAACAGATCTTTGGTTAAGCATCTGAAGAAGCTTTTCCTATCCTTAAACCTTAAGGAAAACGATGATGGTATTTTCTTACTGCCTCCGAAGGTTCGTCCAACTTTGGAGGTTGTTGGGCCTATGATTTGCTCGCATTTAAGACCCAAAGACCAGCAGTTTTCTAATTCTGCATCAGCAAACGTTATGGAGTCTATACCATTGGATGCAGATAGTAAAAAAATGATAGATGACAATCAATTGACAAAAGATGATGCTTCTGCTCCTAAACGAAGGGCATCAAGGAATAGTTATTCTTATCGTGGGGCAATGGACCTTACGGGACTAGGGAAGATTTCTCCAAAGAATTCTTCCTATTTGTTATATGTTGGAGTATTGGAAAGAACGAAGTACGATAATCCAAAAGACTATAGCCGCCCTATACTGGAGGCTGCTGTCGAGGATCCAAATGGTACAAGCCTCCACAATTTTGCTCTCTTTCAAGTGTCTTCCTTGTCTTCAGGTCTTGAAATGAATAAAGGGGTGATTGGTCCTGAAATGCCATCTTCAGAGTTACTTGCTGCAGCAGCTAAATTAACAGAAGCAGAATCTTTGCTGAGAGAAGCTGAGTTGGAGAATGACACTGAAATATTTATAGGTCCTCCACCGCCTGCTGTGGTTGCTGAAGCTGAGTCAGCAAATGACGCAGAGCGCTTTGAAGAGGTTCTTAACTCTATCCATTATACAACGCTGATGAATTTGTTTAAGAATTATCTTCTTCCTTGTGTACATAATGTTAGTTGA |
Protein: MNLVGISPDIATHSILINCFCGLRRVDFGFSVLGSILKRGYAPNVVTFTTLIKGLCAEDRIIQGVELLNKMAETEYQPNIITYGTIINWLCKTGNAGTAMRVLRKMEKGSCKPSLVVYSMIIDSLCKDRLVTKALELLSEMTGKGIQPDVVTYSCLIDGLYSSGRWKEATRLFNEMRKIKEVPNCRTSKSIRSKRKHLEETSSSSSSQSSSSDPDSERSPEKRYSSHRHREDRHKSINGSKKEKEKNERTGKSRNKKERQDRKGKQRRDPKRRSRREDRRRLAKTNDTESSDDDHLEPSKSRNKPETILRYILKTFLNVGDDLKQLLKMIDDGQAVDTRGISNRSLVKHLKKLFLSLNLKENDDGIFLLPPKVRPTLEVVGPMICSHLRPKDQQFSNSASANVMESIPLDADSKKMIDDNQLTKDDASAPKRRASRNSYSYRGAMDLTGLGKISPKNSSYLLYVGVLERTKYDNPKDYSRPILEAAVEDPNGTSLHNFALFQVSSLSSGLEMNKGVIGPEMPSSELLAAAAKLTEAESLLREAELENDTEIFIGPPPPAVVAEAESANDAERFEEVLNSIHYTTLMNLFKNYLLPCVHNVS |